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ABSTRACT 

One of the objectives of a research effort at the Naval 
Postgraduate School is the realization of a "simulation pro- 
gram generating system" to produce executable computer pro- 
grams from natural language descriptions of simulation 
problems. The basic part of the system being developed is 
a FORTRAN program for translating input text into a repre- 
sentative data structure and constructing output text from 
the data structure. Previous work has resulted in the 'capa- 
bility to translate the data structure for a simulation 
problem into an English text description and a GPSS computer 
program. The purpose of this thesis was to supplement rhe 
above with an interactive question-answer scheme that gen- 
erates the simulation problem data structure. The resulting 
GPSS "simulation program generating system" is capable of 
handling a variety of simple queuing problems. 
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I. INTRODUCTION 



System analysis through computer simulation is widely 
recognized as a valuable tool for optimizing system perfor- 
mance and reducing operating cost. The process of creating 
a usable computer simulation model ultimately results in 
the need to transform a conceptual model of the system 
being studied into a suitable computer program. Creating 
this program requires specialized knowledge in computer 
simulation techniques and programming. A system which could 
automate this task would significantly reduce the time and 
cost of producing a computerized model. 

A. PROGRAM GENERATING SYSTEMS 

Higher-level computer programming systems such as 
FORTRAN, ALGOL, or ' PL/1 may be loosely classified as "pro- 
gram generating systems." In this context, the programmer 
writes statements that comply with the syntax and semantics 
of the language. A compiler then generates an executable 
machine language program from the statements provided. 

In this thesis, however, "program generating system" or 
"program generator" implies a method of obtaining a usable 
computer program (possibly in a higher-level programming 
language) without having to write statements in a program- 
ming language. In a system of this type, the problem to be 
programmed might be stated by filling out a questionnaire, 
where the answers could be key-punched on computer cards. 
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as input data for the program generator, which would then 
produce the desired computer program. Reference 1 des- 
cribes such a system developed at The RAND Corporation. 

The RAND system employs decision tables and a statement 
list of computer commands in the output language. The deci- 
sion tables specify the command or series of commands re- 
quired from the statement list as a function of the choices 
selected on the questionnaire. Reference 2 describes in 
detail a specific application of this system to "job shop" 
simulation programming. 

Another type of program generating system could be one 
which accepts as input data an English language statement of 
the problem to be programmed. This presents the problem 
of translating English language into a usable computer pro- 
gram, or into a computer data structure which adequately 
represents the information in the English text. This task 
falls into the realm of "natural language processing." 

B. NATURAL LANGUAGE PROCESSING 

Reference 3 reviews much of the recent work done in the 
field of natural language processing. It discusses, pri- 
marily, systems which (1) answer questions, (2) interactive- 
ly engage the user in apparently intelligent conversation, 
or (3) solve word-problems. Natural language processing 
involves, in some form, a method of transforming or decoding 
the English text into a computer data structure and a sys- 
tem which analyzes the data structure, and encodes an 
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appropriate output. A "natural language program generator" 
would, of course, produce a computer program as the final 
output . 

At the Naval Postgraduate School, an interactive system 
has been developed to perform the decoding and encoding pro 
cesses mentioned above. This system, called NLP (Natural 
Language Processor) is capable of translating input text 
into a data structure by following "decoding rules," and 
then analyzing the data structure and producing output by 
following "encoding rules." NLP is presently configured 
to operate on the IBM 36O/67-CP/CMS time-sharing system at 
the Naval Postgraduate School. V/ith minor modification, 
however, NLP could operate on most time-sharing systems 
with a FORTRAN IV capability. 

C . BACKGROUND 

Current research with NLP i.s directed toward the pro- 
blem of decoding a natural language description of a compu- 
ter simulation problem and generating (encoding) an 
appropriate computer program to perform the simulation 
[Ref. 1 !]. Decoding rules have been written to translate sim 
pie English sentences into a data structure of linked re- 
cords. These records contain "attributes" and "indicators" 
which describe the content and meaning of a sentence. Cor- 
responding encoding rules have been written to process the 
data structure and encode sentences according to the 
attributes and indicators of the records. 
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An appropriate internal data structure (IDS) for simple 
queuing problems and a set of NLP encoding rules, called 
GES (GPSS Encoding System), that produce computer programs 
in the GPSS simulation language, have also been developed, 
as reported by Hansen [Ref. 5]. In addition, McGee [Ref. 6] 
describes the development of XGES, an expanded and enhanced 
version of GES, and the development of a set of encoding 
rules which produce, from the IDS, a natural language state- 
ment of the simulation problem as "understood" by the 
computer. 

D. OBJECTIVE ' 

The objective of the research reported on in this thesis 
was to design and implement, within the framework of NLP, 
a procedure for developing simulation programs by interactive 
quest ion-answering at a computer time-sharing terminal. 

The result of this work is a set of decoding rules and the 
FORTRAN subprograms needed for NLP to function as a "pro- 
gram generating system" for simple queuing simulation pro- 
blems. This research has also provided some basic techniques 
and background data that can be used in the development of 
the full natural language processing capability. 

E. ORGANIZATION OF THESIS 

The remainder of the thesis is divided into four sec- 
tions. Section II describes the relevant portions of NLP, 
in particular the decoding process. Section III covers the 
question-answer scheme and the FORTRAN control program. 
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The decoding rules and some decoding techniques are dis- 
cussed in Section IV. Section V presents a specific simu- 
lation problem, recommended user preparation for operating 
the question-answer system, the terminal session, and the 
encoded English text description and GPSS program. Conclu- 
sions and recommendations for further work are contained in 
Section VI. 



II. OUTLINE OF NLP AND DECODING 

NLP may be thought of as a two-part system, the NLP 
Compiler and the NLP Machine. Both parts are intimately 
related to a list-structured, record-based memory. This 
section contains a discussion of the memory, the Compiler, 
the Machine, and NLP rules for decoding and encoding. 

A. NLP MEMORY STRUCTURE AND USE 
1. The Cell 

The basic unit of memory in NLP is the "cell." 

Each cell consists of eight bytes (64 bits) of IBM 360/67 
computer memory. The cells are sequential elements of a 
FORTRAN array (named CELL) and are addressed accordingly. 

A cell may be used as an eight byte word or as a group of 
four 2-byte words . When used as an eight byte word a cell 
could represent eight characters in EBCDIC code, a numeric 
value, or a set of 64 on-off indicators. In the four 2- 
byte word format the "rightmost" word always contains either 
a zero or a pointer to (the address of) another cell. The 
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remaining three words contain data which are normally numer- 
ical values, cell addresses, or on-off indicators. 

2 . Records 

A record is a list of cells, and is used to repre- 
sent an "entity" by holding the values of its "attributes." 

For example, a record could represent the punctua- 
tion mark or the object "Red’ Brick; hardware store; 

1125 Market St., Dayton, Ohio; value $50,000; owner J.M. 

Doe; For sale," or the sentence "The men gave the small 
boys money." 

The number of cells in a record depends pn the num- 
ber of attributes specified. The. values of attributes may 
be actual numerical values or pointers to other records. 

A special kind of attribute is the indicator (switch value) 
which, when "turned on," specifies the existence of a 
particular condition. 

For example, a record representing the tires on a 
vehicle could have a "quantity" attribute with a value of 
4, and a "color" attribute with a value of 'black.' The 
record could also have a "whitewall" indicator to specify 
that the tires have white sidewalls, and possibly a "tube- 
less" indicator to specify that they are tubeless. This 
record could also have a "superset" (SUP) attribute which 
would indicate that the entities referred to by the record 
are members of a class of objects called "tire." This re- 
cord may be input to NLP in the following manner: 
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VEHTIRE(SUP= r TIRE ' , COLOR= 1 BLACK 1 , QUANTITY*^, 

VJHITWALL, TUBELESS) 

where WHITWALL and TUBELESS would have been previously de- 
fined as indicators. Other records, representing trucks 
or automobiles, could point to this record as the value of 
their "support" attribute. Reference 5 contains a detailed 
description of the cell and record structure and the manner 
in which attributes and indicators exist and are referred 
to within a record. 

In general there are three classes of records: 

a. Permanent records 

Most permanent records are produced by the NLP 
compiler during initialization of NLP. These records repre- 
sent items such as decoding rules, encoding rules, objects, 
attribute names, indicator names, words, etc. Permanent 
records establish a "configuration" of the NLP Machine. 
Although these records are called permanent, there is a 
capability for adding or deleting certain of them in the 
NLP command mode. 

b. Semi-permanent records 

These records are the end result of the decoding 
process. Some form of each semi-permanent record exists in 
the IDS that is ultimately used to encode the output. Semi- 
permanent records are usually deleted if the user chooses 
to re-run the NLP Machine using the same or any other 
configuration . 



12 



c. Transient records 



Transient records exist only for the purpose of 
creating other records, or for modifying or adding to the 
structure of other records. Numerous transient records are 
created and deleted during the decoding and encoding pro- 
cesses. The term "SEGMENT" record is used in Ref. 5 to 
refer to transient records. 

One unique permanent record, called MEMORY, is 
the coordination center for the NLP Machine. MEMORY is 
always directly accessible from both the FORTRAN control 
program and the decoding and encoding rules. MEMORY w as 
used extensively in this project to communicate between 
the questionnaire control program and the decoding rules. 
MEMORY also serves as the starting point for GE3 and XGES , 
and the questionnaire. The many uses of MEMORY will 
become apparent in later sections. 



B. NLP COMMAND MODE AND COMPILER 

The execution of NLP begins in the command mode . The 
first user action is to initialize the system from external 
data sources. External card image files contain rules writ 
ten in NLP "rule language" format, records that prescribe 
the initial IDS, and optional lists of attribute and indica 
tor name definitions. These files define a particular NLP 
Machine starting configuration. When the user specifies a 
file to be processed, the NLP Compiler translates the input 
file into an internal record representation of the informa- 
tion in the file. After each file is processed, the user 



13 



can cause the current NLP record structure to be dumped onto 
another external file. This greatly reduces future pre- 
processing time when the user desires to re— initialize NLP 
to a particular partial or complete configuration. Other 
commands available to the user include entering a new file, 
deleting a rule, defining new attribute names,, indicator 
names, and permanent records, printing out records 
specified by name or number, and initiating the NLP Machine. 

C. NLP MACHINE 

Running the NLP Machine is the process of decoding input 
text into the IDS or encoding output from the IDS. Refer- 
ences 5 and 6 describe the GES and XGES encoding systems, 
and specify an IDS for representing simple queuing problems. 
For testing GES and XGES, the IDS had to be produced manual- 
ly and entered from an external file via the NLP Compiler. 
The end-product of this thesis is a system for producing 
an encodable IDS from a time-sharing terminal via decoding 
rules and question answering. While operating the question- 
answer system, the user "runs" the NLP Machine. 

D . NLP RULES 

A set of rules (decoding and/or encoding) may be thought 
of as a "rule language" program for the NLP Machine. Each 
rule specifies a data structure transformation to be 
performed when certain conditions are met. 
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Encoding rules and decoding rules are similar in con- 
struction and effect, but are conceptually opposite in appli- 
cation, and are executed (internally) quite differently. 

In addition, they are processed separately by the NLP Com- 
piler and operate independently under the NLP Machine. The 
user can not simultaneously decode input and encode output, 
although the NLP Machine can be easily modified to automa- 
tically switch to encoding (from decoding) when a particular 
IDS condition is sensed. 

1 . Rule Construction and Meaning 

A rule consists of a left part and a right part 
separated by the transformation symbol. The data 

structure changes indicated in the right part of a rule are 
performed when the conditions of the left part; are 
satisfied. The following is a typical NLP rule: 

SEG1 ( ATTR1 . NE . 0 , IND1 , -,IND2 ) — > SEG2 ( ATTR1=0 , -IND1 , IND2 ) 

where SEG1 and SEG2 are names of SEGMENT records. When the 
rules are initially processed, the NLP Compiler creates a 
SEGMENT TYPE record for each SEGMENT record name found in 
the rules. One attribute of a SEGMENT TYPE record is a 
list of decoding rules which begin with a SEGMENT of that 
type, e.g.,"SEGl", on the left. "ATTRl" is an attribute 
name, and "INDl" and n IND2" are indicator names. (If IND1 
and IND2 were not initially defined as indicators they 
would be considered attributes when encountered in the rules, 
and processed accordingly.) The parenthesized items on the 
left are condition tests, and on the right are "labeling'' 
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specifications (certain types of condition tests are also 
permitted on the right).. The absence of one or both of the 
parenthesized expressions means no condition testing (other 
than the fact that the SEGMENT exists) and/or no labeling 
operations are required when the rule is executed. The 
example rule given above can be read: 

"If a SEGMENT record of type SEG1 exists with its 
ATTR1 attribute not equal (NE) to zero, its IND1 
indicator 'on', and its IND2 indicator 'off', then 
create a new SEGMENT record of type SEG2 and set 
its ATTR1 attribute to zero, its IND1 indicator 
to 'off', and its IND2 indicator to 'on'. 1 ' 

Hereafter, the term "SEG2" will be synonomous with the 
phrase "SEGMENT record of type SEG2 . " 

2 . Decoding Rules 

Each SEGMENT record created by the NLP Machine via 
the decoding rules represents the information found in a 
storing of input text. The relative indices of the first 
(from the left) and last characters covered by a SEGMENT 
record are stored as attributes of the record. This feature 
is employed in the following type of decoding rule to combine 
adjacent (reading from the left) elements of text: 

SEG1 SEG2 --> SEG3 

This rule will create a SEG3, itfhen a SEG1 and a SEG2 exist, 
that represent consecutive portions of the input character 
string. If the SEG1 covers character index positions l- 1 ! 
and SEG2 covers positions 5-8, then the SEG3 will cover 
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character positions 1-8. Rule language syntax requires 
that each element of a rule be separated by one or more 
blank spaces, and that each rule begin on a new card 
image line in the file. 

During initialization of NLP, a SEGMENT TYPE record 
is also created for each letter, digit, and punctuation 
mark. Thus it is possible to write NLP decoding rules 
which result in the creation of SEGMENT records which 
represent the meaning of a string of English text. For 
example, the text: 

BLAC.K CAT 

could be processed by the following decoding rules to create 
a NOUNPH which would cover the text from the B to the T and 
’would represent the information "a four-legged, black animal 
in the cat family": 

BLACK — > ADJ( ’BLACK’, COLORIND) 

CAT --> NOUN ( ' CAT - , ANIMAL, LEGS =4) 

ADJ( COLORIND) # NOUN --> NOUNPH ( J&NOUN, C0L0R=SUP (ADJ) ) 

In the above rules, on the right, the notation 
’BLACK’ tells the NLP Machine to set the SUP (superset) 
attribute of the ADJ record to point to the record named 
BLACK. Records with names are referred to as "named re- 
cords." The named record for BLACK could have a SUP attri- 
bute pointing to the record named ABSCOLOR, etc. Also, the 
notation %NOUN means copy all information from the NOUN 
record into this record; and the if on the left indicates 
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the presence of a blank space in the input string. Appen- 
dix B contains additional information on the syntax and 
semantics of the decoding rules. 

III. THE QUESTION- ANSWER SCHEME 

The system developed to meet the objective of this the- 
sis consists of a FORTRAN control program to interact with 
the user, and a set of NLP decoding rules to interpret user- 
responses and generate an encodable IDS for GES . Each ques- 
tionnaire session is conducted in three phases, and is 
initiated by the NLP command "QUESTION:" entered at the 
terminal. The first part of this section is a description 
of the general structure of the quest ion-answer scheme. 

Then each of the three phases of processing are discussed. 
The emphasis is on the operation of the control program 
QUEST, a listing of which appears in Appendix D. 

A. GENERAL STRUCTURE AND FORMAT 

During a question-answer session, many of the computer 
produced questions and instructions to the user contain data 
(usually attribute or identification information) provided 
in previous user responses. Also, the nature of a user 
response often determines the next computer question or 
instruction. Certain user responses may be entered in 
several different forms because the basic decoding scheme 
scans the input string for the word, phrase, or number 
values which are appropriate. For example, when th user 
is prompted with: 
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’ NAME NEXT STATIONARY ENTITY OR TYPE "NONE"' 



any single word response that was not previously defined, 
or was defined as a stationary entity name (not previously 
used), villi be decoded as a valid name for a new stationary 
entity. The subsequent question will then pertain to that 
stationary entity. The "NONE" part of the above prompt, 
however, would be satisfied with any of the following 
responses : 

N 

NO 

NONE 

DONE 

NO MORE 

NO MORE STATIONARY ENTITIES 

FINISHED 

END 

In this case the decoded response (a "done" indication) 
implies that this phase is complete. Any response that 
does not constitute a valid stationary entity name or "done" 
indication would result in an error message sequence being 
typed at the terminal by the computer. 

An error message sequence consists of a reprint of the 
previous input string with a $ under the last character 
absorbed by the decoding rules, and (in this case) one of 
the following messages: 

(a) ’ ** INVALID STATIONARY ENTITY NAME' 

(b) ' ^INCOMPLETE REPLY AT $... RE-ENTER ' 
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After an error indication the user is prompted with the 
same (or a similar) question or instruction. The error 
detection scheme is discussed in a later' section . 

In QUEST, each occurrence of the FORTRAN statement 
"CALL DECODE(ONE, &305) " results in rhe terminal unlocking 
for a user response, and the subsequent text decoding (via 
the rules) being initiated. If decoding is successful 
(a REPLY is found) and/or the input string is exhausted, 
a return to QUEST is effected. At this point the logical 
FORTRAN variables OK (OK=true means "valid response detect- 
ed") and DONE (DONE=true means a valid "done" response de- 
tected) may be examined to determine subsequent action. 

DONE = true implies OK=true, hence a DONE check (when appli 
cable) always precedes an OK check. OK=false always re- 
sults in the general error message (b) above, if a specific 
error message sequence was not initiated from the decoding 
rules, and a user prompt requesting the same entry. 0K= 
true causes the computer to prompt the user with the next 
question or instruction.. DONE=true (when applicable) is 
used to indicate the termination of a data entry phase or 
cyclic sequence and causes the next portion of the 
questionnaire to be initiated. 

One control feature of QUEST is the "checkpoint" (indi- 
cated by the parameter: CHKPNT) . This signals the decoding 
system to enable certain additional decoding rules. The 
FORTRAN statement: 



20 



CALL HSTORE ( ONE, CHKPNT, MEMORY) 
causes the numeric value 1 to be assigned to the CHKPOINT 
attribute of the IDS record MEMORY. This value can then 
be accessed from the decoding rules. For example, the 
rule : 

SEG1 ( CHKPOINT ( MEMORY ) . EQ . 1) — > SEG 2 
would create a SEG2 only when a SEG1 exists and the CHKPOINT 
attribute of MEMORY has the value 1. 

Additional format and structure features are explained 
in the following sections. 

B. PHASE 1: STATIONARY ENTITIES 

A stationary entity in a simulation problem represents 
any physical element of the system being simulated which 
does not change in position or usage during the simulation 
problem. A stationary entity usually represents a point of 
contact, such as a processing or holding point, between the 
system and one or more mobile entities which pass through 
the system. Examples of stationary entities are a gas pump 
in a service station, a pier in a harbor, a bank, and a teller 
window in a bank. Phase 1 is the process of identifying 
and labeling stationary entities. 

Each stationary entity is represented in the IDS by a 
record. All stationary entity records except for the one re- 
presenting the system being simulated have, as a minimum, 
an identification number (IDNO), a SUP attribute, and a 
CAPACITY attribute. In addition, they may have an arbitrary 
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number of "optional" attributes, such as LOCATION, COLOR, 
etc. Figure 1 shows the functional question-answer sequence 
from the start of QUEST through complete definition of all 
stationary entities. Each action (defined in Section III-D) 
is considered to occur at or within a stationary entity. 
Therefore, each action has a LOCATION attribute that points 
to a stationary entity record. Stationary entity records 
contain the information needed by GES to create the 
corresponding GPSS Storage or Facility. 

C. PHASE 2: MOBILE ENTITIES 

A mobile entity in a simulation problem represents the 
dynamic element that is processed or serviced by the system 
being simulated. Mobile entities can be thought of as 
entering, being processed in one or more stages, and leaving 
the system. In a GPSS program, mobile entities are repre- 
sented by "transactions." Examples of mobile entities are 
a customer in a bank or store, an automobile in a gas sta- 
tion, and a ship in a harbor. Phase 2 is the process of 
identifying and labeling mobile entities. 

The question-answer decoding system creates IDS records 
for mobile entities. These records contain the data need- 
ed by GES to produce a GPSS program that generates repre- 
sentative transactions. Every mobile entity record con- 
tains, .as a minimum, SUP and IDNO attributes. The question- 
answer system allows the user to define a maximum of one 
level of sub-structure for each different (by SUP attribute) 
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Figure 1. Stationary Entity Entry Sequence 
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mobile entity. If a mobile entity record has a sub- 
structure, it also has a classification (CLASATR) attribute 
which points to the attribute that delineates the sub- 
structure. Each sub-structure record has a different value 
for that attribute, plus a STRUC attribute that points to 
its super-structure record. Each sub-structure record also 
has a QUANTITY attribute specifying the percentage of mobile 
entities of that particular type that belong to this sub- 
classification. Figure 2 gives an example of this 
relationship. 



' REC1 ' 




Figure 2. Sub-Structure Relationship 

The symbol in Figure 2 means the attribute on the left 

points to (contains the address of) an IDS record which is 
used to represent the data on the right. The "=" symbol 
means the attribute on the left contains the actual value on 
the right. The CONSUMP attribute of REC1 indicates the num- 
ber of service units each 'SHIP' requires at a processing 
point. Figure 3 shows, functionally, the question-answer 
sequence for creating mobile entity records. 
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D. PHASE 3: ACTION SEQUENCE 



There are two types of actions that can occur in a 
simulation problem, an event or an activity. An event is 
an action that does not consume time within the simulated 
system. A car arrives at a gas station, or a customer 
leaves the store, are typical examples of events. Activi- 
ties, on the other hand, are actions which consume simulated 
time, such as: a car is serviced at the gas pump, or a blue 
ship unloads cargo at the dock. Records representing actions 
and their interrelationships form the heart of the structure 
from which GES encodes a GPSS program. Phase 3 is the 
process of developing this structure. 

Every action record for an activity contains the fol- 
lowing essential attributes: SUP (the name of the activity), 
SUCC (pointer to the "successor" action record), MBNTY 
(pointer to the record for the mobile entity involved in 
this activity), LOCATION (pointer to the stationary entity 
where the activity takes place), and DURATION (describes 
the activity time duration, usually via a probability dis- 
tribution). Events may or may not appear as action records. 
The arrival and departure of mobile entities are events 
which are represented by action records. In this case, all 
attributes listed above for activities, except DURATION, 
are required. The action record for arrivals must, however, 
contain an inter-arrival time (IETM) attribute. An event 
not explicitly represented by an action record is typically 
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1 




Figure 3. Mobile Entity Sequence 
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f 

one which yields a GPSS Function and TRANSFER block, and 
appears as the SUCC attribute of an action record. 

Developing a question-answer decoding procedure that 
would produce an encodable IDS for the action sequence 
proved to be the most difficult part of this research. 

The procedure chosen is closely related to the action list 
(an IDS record which is a list of pointers to the action 
records). To initiate the sequence, an action record (ARRIV) 
for the arrival of the first-entered mobile entity is auto- 
matically generated by. the decoding rules. The user is 
instructed to specify an inter-arrival time probability 
distribution for all mobile entities and a basic time unit 
for the simulation. The decoding rules enter the ARRIV 
record on the action list without a SUCC (successor) attri- 
bute. The basic action-builder loop starts with the de- 
coding rules picking up the first action (from the action 
list) that does not have a SUCC attribute and is not a 
LEAVE event. The user is then prompted to enter information 
concerning the next action. The result will be one or 
more new action records, each without a SUCC attribute, 
and each entered on the action list. In the process, the 
action record which started the cycle w ill receive a SUCC 
attribute, and the stage is set for the next cycle. In 
essence, the question the user must answer is: 

"What does the mobile entity <name 1> do after the action 
<act 1> is complete?" 
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If the response sequence results in a single new action 
record, the cycle simply repeats. However, the user might 
wish to say: 

"After <act 1> , 30 % of the <name l>'s go to <act 2> , 

. 40% go to <act 3> , and 30% go to <act 4>." 

This response would be entered as a. "conditional" (dis- 
cussed later). The problem here is that <act 2>, <act 3>, 
and <act 4> would not have been defined or identified yet, 
and the action just specified is an event that does not 
yield an action record. To solve this problem, the concept 
of a "temporary action" (TEMPACTN) was developed. A TEMPACTN 
record is created by the decoding rules in every case where 
an action must be defined or identified later. Each TEMPACTN 
record is added to the action list without a SUCC attribute, 
and is filled with reference information. After a TEMPACTN 
is processed by the action-builder, it is always deleted 
from the IDS, and its position in the action list may be 
filled by an action record. The action-builder sequence 
terminates when every action on the action list has a SUCC 
attribute or is a LEAVE event, and all mobile entity types 
have been processed. 

Figure 4 is an overall flow diagram of the action-builder 
sequence. Another way to follow this procedure in more 
detail is to examine the ACTION LOOP portion of subroutine 
QUEST in Appendix D. 
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Special GPSS actions such as: 

TRANSFER ,FN7 (branch according to function 7) 

GATE NU 1 (hold here until facility 1 is not 

in use) 

can not be conveniently entered in the same manner as the 
action : 

"service customers from a queue, service time is 
exponentially distributed, mean=12 minutes." 

Hence the concept of a "conditional" was used. Conditionals 
require key-word entries from the terminal which activate 
specific decoding rules and cause unique questions or instruc- 
tions to be issued to the user. The implementation of 
conditionals is general and can be conveniently expanded 
both in QUEST (via the computed GO TO statement) and in the 
decoding rules under a unique checkpoint value (using the 
rule test: CHKPOINT (MEM) .EQ . 5 ). Initially, only the two 

conditionals mentioned above were implemented. 

The mobile entity sub-structure scheme described in 
Section III-C provides an additional capability while con- 
structing the action sequence. For example, in the case 
where the first mobile entity record represents 'SHIP' 
and has a CLASATR+ ' COLOR ' , the user would be prompted in the 
following manner (after the ARRIVE action record was 
completely built by the decoding rules): 
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• 

FOLLOWING THE ACTION: 

SHIP ARRIV AT PORT 

DOES ROUTING DEPEND ON COLOR OF SHIP ? 

<response> 

If the user answers 'YES’ then the decoding rules automa- 
tically create "function definition" records and a TEMPACTN 
record for each 'SHIP' mobile entity record in the sub- 
structure (each record with SUP-*-’ SKIP’ and a STRUC attri- 
bute). Function definition records are ultimately encoded 
by GES into GPSS functions that are used to assign a trans- 
action parameter, or determine transaction routing, etc. 

A 'YES' response would then be followed by: 

FOLLOWING THE ACTION: 

BLUE SHIP ARRIV AT PORT 
cCONDITIONAL option prompt> 

<response> 



as a result of a TEMPACTN record. In this case, the MBNTY 
of the TEMPACTN is the sub-structure 'SHIP' record with 
COLOR-*- ' BLUE ' . In the course of defining the next action the 
user would also be asked: 



WILL NEXT ACTION APPLY TO ALL SHIP 

VICE BLUE SHIP ONLY? 



<response> 
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This determines the MBNTY attribute for the resulting 
action record. If the answer is 'YES' the MBNTY attribute 
points to the original (top level) 'SHIP' record. Also, 
if the MBNTY of the action '‘UNLOAD' has a sub-structure, 
and the user is at the point of defining a duration for 
the action, the following question is asked: 



DOES DURATION OF UNLOAD DEPEND ON COLOR OF SHIP ? 
<response> 

A 'YES' answer would cause the decoding rules to create 
function definition records for the selection of" 'UNLOAD' 
time durations based on 'COLOR'. 

When the action-builder sequence is finished, the user 
is prompted to enter a value for total time to be simulated. 
The IDS is then complete and the computer prints: 

DATA STRUCTURE FOR PORT SIMULATION IS COMPLETE 
QUEST then terminates and user control is returned to the 
NLP command level. 

IV. THE QUESTION- ANSWER DECODING RULES 

NLP decoding rules are used to specify the processing 
of user responses needed to produce IDS records. General con- 
struction and application features of NLP decoding rules 
are discussed in Section II-D and in Appendix A. The pur- 
pose of this section is to describe the decoding rule 
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implementation for the question-answer decoding system. 
Appendix B contains a complete listing of these, decoding 
rules. 

A. GENERAL FEATURES 

As mentioned before, a decoding rule can communicate 
with the FORTRAN program via the special record MEM (an 
acceptable rule-language abbreviation of MEMORY).. This 
capability is used extensively in the question-answer de- 
coding system. The following are examples of its usage: 

( 1 ) " . . . , CHKPOINT ( MEM ) . LT . 5 a • . . " This is a condition 
test that allows further processing of this rule only if 
the value of the CHKPOINT attribute of MEM is less than 5- 
This rule execution control attribute is used frequently 
by QUEST. 

( 2 ) " . . . , ERR (MEM) =17 , . . . " This causes MEM to have an ERR 
attribute, if not already present, and assigns it a value 
of 17. Attribute values can be extracted and examined in 
the FORTRAN program by use of the HVAL function subroutine 
of NLP, if the SEGMENT record address is known (MEMORY 
is always known) and the attribute identification number 
is known (normally pre-defined, for the rules and FORTRAN, 
in the initial NLP machine configuration) . 

Another method of rule-FORTRAN communication is by 
the use of "ROUTINES.’ 1 When used, ROUTINE names and identi- 
fying numbers must be defined before the NLP Compiler proces- 
es the rules. While running the NLP Machine, if a ROUTINE 
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name is encountered in parentheses during execution of a 
rule, a set of FORTRAN statements is executed according to 
the identifying number associated with the ROUTINE. In the 
quest ion-answer decoding system, for example, the name 
ROK is defined as ROUTINE 21. The corresponding FORTRAN 
statements cause the logical variable OK to be set to "true." 
The state of OK is frequently tested by the control program 
QUEST. Appendix E is a listing of the ROUTINES that were 
written for the question-answer system. 

B. THE DECODING SCHEME 

In general, the question-answer decoding rules first 
parse the input string into SEGMENT records representing 
words and numbers, then into more comprehensive and specific 
SEGMENT records based on the type and relationship of the 
previous records. Many other conditions, such as CHKPOINT 
value, questionnaire phase, attributes and indicators in 
IDS records, etc., are used to determine the correct parse 
sequence for an input string. The goal is to appropriately 
add the information in an input string to the IDS. The 
question-answer decoding rules were designed to allow a 
great amount of flexibility in the input string by 
detecting key words or phrases. 

1 . Record Creation and Labeling 

Each time the decoding process is begun by calling 
DECODE from QUEST, the result is either the creation of 
a new semi-permanent IDS record or the modification of an 
existing IDS record. To complete an IDS record for, say. 



a mobile entity, the user must respond at least four times. 
Each' response completely re-initiates the decoding process, 
which presents the problem of remembering the address and 
status of the current mobile entity record and determining 
which rules to apply. The problem was solved in part by 
the use of the attribute CURSEG (current segment record) 
of MEMORY. When a mobile entity name is entered at the 
correct response point in QUEST, the basic IDS record is 
created and CURSEG(MEM) is set to point to this record. 

The rules applied for subsequent user responses have access 
to this record via CURSEG(MEM) and can test conditions, 
label, etc., accordingly. There are numerous examples of 
the use of CURSEG(MEM) in the rules of Appendix B. 

2 . Naming Entities, Actions, and Attributes 

The decoding rules allow for arbitrary naming of 
stationary entities, mobile entities, actions, and optional 
attributes. This is accomplished by a series of rules 
that pick up arbitrary character strings. For each segment 
of any input string that begins with an alphabetic charac- 
ter and is properly delimited by blank spaces or other al- 
lowable punctuation marks, a segment of type WORD is even- 
tually created with the SUP attribute set to the arbitrary 
name. Although the name may have up to 80 characters, the 
record for the SUP attribute will contain only the first 
eight characters. This general word accumulation process 
is accomplished with two ROUTINE'S. The first, RGETJ, stores 
the index of the first character of the WORD as the JNUM 
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attribute of the WORDP (word part) segment record that is 
initially created. When the tail end delimiting character 
(PUNCB) occurs, a WORD segment is created and the routine 
RSETSUP, using the value of JNUM, sets the SUP attribute 
as described above. In a sense, the system "learns" new 
words or names, and classifies them as "unknown" until their 
context is determined. Future' reference to any name must 
comply with its original usage or (for pre-entered names) 
pre-defined subset classification. Numerical values are 
assembled in a similar .manner although no ROUTINE'S are 
needed . 

3 • REPLY Segments 

The final SEGMENT TYPE that is formed when a satis- 
factory user response is decoded is the REPLY. When a 
REPLY is detected, or the end of the input string is sensed 
and no more decoding rules apply, a return to QUEST is 
effected and the user is prompted with a new question or 
instruction, or receives an appropriate error notification. 

4 . Error Detection Capability of the Rules 

In addition to the error detection capability men- 
tioned in Section III, some error message sequences are 
initiated by the decoding rules. This is accomplished by 
RERR (a ROUTINE) and the following rule language notation: 

xxxxx ( , ERR (MEM) = 17 ,RERR , ) 

The ROUTINE RERR extracts the value of the ERR attribute 
of MEMORY and calls the FORTRAN sub-program ERROR with that 
value (the error number) . RERR is normally used to notify 
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the user of illegal use of a previously defined name. ’BLUE' 
could not, for instance, be used as anything other than the 
value of a COLOR attribute. This type of error also causes 
the terminal to print the super-set (SUP) "chain" for the 
illegal name. (To examine or traverse the SUP "chain" 
means to look at the value of the SUP, then the value of the 
SUP of the SUP, then the value of the SUP of the SUP of the 
SUP, etc., until the desired value is located or the chain 
is exhausted.) In this example, if 'BLUE' were entered 
as a mobile entity name, the user would receive the following 
error message reply: 



BLUE "] 

$ f error message 

**INVALID MOBILE ENTITY NAME J 
BLUE *> ABSCOLRl 
ABSCOLR QUALVAL V SUP "chain" 

QUALVAL VAL J 

RE-ENTER NAME OF MOBILE ENTITY 

<response> 

The FORTRAN subroutine PRTCLS (included in Appendix D) is 
used to print the SUP "Chain." 

5 . Optional Attributes 

During the entry of each mobile and stationary 
entity, the user is given an opportunity to enter an un- 
limited number of optional attributes. At this point, any 
attribute not previously defined for that record may be 
entered. Each entry requires an attribute identification 
name and value (name or number) . Errors result when 
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pre-defined names (identification or value) are not 
correctly used. The following is an example- oT variations 
in user response that are acceptable: 

COLOR BLUE 
COLOR=BLUE 
COLOR is BLUE 

THE COLOR OF THE SHIP IS BLUE 

An example of an invalid optional attribute entry and error 
sequence is : 

ENTER OPTIONAL ATTRIBUTE FOR SHIP 
COLOR IS LARGE 

$ 

**ATTRIBUTE IS NOT IN CLASS INDICATED 
ENTER OPTIONAL ATTRIBUTE FOR SHIP 

<response> 

The applicable rules recognize that the pre-defined word 
LARGE is not a legal value of the attribute COLOR. How- 
ever, the rules will accept any "unknown" word as a value 
of the attribute COLOR and would set the SUP of the unknown 
word to ’ABSCOLR*. In this manner the system is capable 
of "learning" and classifying new names at virtually any 
point where name entries are required. 

6 . Action Generation 

As pointed out in Section III-D, the decoding rules 
search the action list for the oldest action description 
record which does not have a SUCC (successor) attribute. 

The attribute LASTACT(MEM) becomes a pointer to this action 
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record, and the MBNTY (mobile entity) attribute of the 
action record becomes the value of MBNTY(MEM). Using the 
action record LASTACT(MEM) , the rules then have access to 
the actual characters used by QUEST to print the line: 

BLUE SHIP APlRIV AT PORT 
These words are set as the values of the pre-specif led 
attributes A1 through A 5 of MEMORY, and are therefore avail- 
able at the FORTRAN level. This communication technique is 
the means by which the user is guided systematically through 
the question-answer session. A much more elegant communica- 
tion technique, however, would be to use the existing NLP 
Machine encoding capability and encode grammatically correct 
• English sentences. This would require an additional set 
of encoding rules, and remains an area of possible expansion 
of the basic question-answer system. 

The action 'generating rules create and assemble 
the action records, distribution records, "conditional" 
description records, etc., needed to describe a simulation 
problem in accordance with the IDS required by GES. All 
actions are prescribed by the user, but the strict question- 
answer format ensures that an encodable IDS will be generat- 
ed. In general the user can cause mobile entities to be 
processed by the system in any desired sequence, including 
routing to formerly defined actions. However, the system 
does not provide a check against user specification of an 
endless loop for a mobile entity. Additionally, the system 
does not allow the user to specify an arbitrary prot at ilit \ 
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distribution, but instead provides a choice (when appro- 
priate) among normal, exponential, and uniform, and asks 
him to provide the necessary parameters (mean, standard 
deviation, range). In lieu of a probability distribution, 
the user may specify a constant value of, for example, 
inter-arrival times of mobile entities. 

When the action list search determines that all 
action records are completed, the user is asked to supply 
the total time for which the simulation is to be run, then 
notified that the IDS representing the simulation problem 
is complete. At this point the user may elect to encode 
an English language description of his simulation problem 
(how the computer "sees" it) using the NLP encoding scheme 
developed by McGee [Ref. 6]. If this description Is satis- 
factory, he may then encode the GPSS simulation program 
using GES or XGES . 



V. SAMPLE PROBLEM 

Figure 5, extracted from Ref.5 s is an example of a 
queuing system simulation problem. Figure 6 shows a sug- 
gested form, to be prepared prior to using the question- 
answer system, that includes the minimum data needed to 
describe stationary and mobile entities. The form in 
Figure .6 is filled in according to the problem description 
in Figure 5* Figure 7 is a flow diagram of the same pro- 
blem. This diagram includes the responses (to questions or 



40 



There is a port containing a harbor, 3 docks, 

2 piers, a depot, and a barge. Ships arrive at 
the port with an interarrival time of 5 hours, 
uniformly distributed, with a range of ± 1 hour. 
50% of the ships are blue ships, 30% are red, 
and 20% are green. After a ship arrives at the 
port, it unloads cargo at any available dock. 

Each dock has a capacity of 1 unit . Each ship 
takes up 1 unit of capacity. Unloading time at 
the dock is normally distributed as follows: 

blue ship - mean of 5 hours, std dev of 1.5 hours 
red ship - mean of 4 hours, std dev of 1.0 hours 
green ship - mean of 3 hours, std dev of .5 hours 

After unloading at a dock, a blue ship unloads 
cargo at the barge, a red ship unloads cargo at 
the depot , and a green ship unloads cargo at a 
pier. The barge has a capacity of 1 unit, a pier 
has a capacity of 1 unit, the depot has a capacity 
of 4 units. Unloading times -are as follows: 

barge - 1.5 hours, exponentially distributed 
depot - 1 hour, exponentially distributed 
pier - 1 hour, normally distributed, std dev of 
15 minutes 

Next, after these latest unloadings, 40% of the 
ships load cargo at a dock, and the remainder wait 
in the harbor. Dock loading time is 2 hours for 
any ship. After loading cargo at a dock, a ship 
waits in the harbor. A ship waits in the harbor 
until the barge is unoccupied. After waiting in 
the harbor, a ship leaves the port. The basic 
time unit is the minute. Problem duration is 4 
days . 



Figure 5. Port Facility Simulation Problem 
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instructions) that the user will be expected to supply. A 
diagram of this sort is extremely helpful for effective use 
of the question-answer system. 

Figure 8 is an actual terminal session showing the 
computer questions and instructions, and the user responses 
for this sample problem. Figures 9 and 10 (extracted from 
Ref. 6) show the computer generated English description of 
this problem and the resulting G-PSS program. The output 
produced by this GPSS program may be found in Ref. 6. 



SYSTEM NAME : Port 



BASIC TIME UNIT: Minute 



TOTAL PROBLEM TIME: 5760min. 



STATIONARY ENTITIES: 



NAME 


QUANTITY 


CAPACITY 


OPTIONAL ATTRIBUTES 


IDNAME 


NAME/VALUE 


DOCK 


3 


1 


LOCATION 


PORT 


PIER 
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1 


Tf 


TT 


DEPOT 


1 


4 


tl 


TT 


BARGE 


1 


1 


ft 


ft 


HARBOR 
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UNLIMITED 


ft 


ft 

































MOBILE ENTITIES: 



NAME 


STRUCTURE 

LEVEL 


SERVERS 

NEEDED 


SUB-CLASS 

ATTRIBUTE 


LEVEL 2 
VALUE 


% 


OPT . 

ATTRIBTS 


Ship 


1 


1 


Color 


• 


100 




Shin 


2 


1 




Blue __ 


50 




ShiD 


2 


1 




Red 


3SL 




Ship 


2 


1 




Green 


20 


















• 















Figure 6. Stationary and Mobile Entity Data Form 
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SHIPS ARRIVE AT PORT 
TIME DIST.: UNIFORM 
MEAN = 300, RANGE ±- 60 



Blue 

BARGE ShiDS 



UNLOAD CARGO 
[EXPONENTIAL DIST 
MEAN=90 



DOCK 



ALL SHIPS 



UNLOAD CARGO 
DURATION DIST. : NORMAL 
Mean STDev 
Blue 300 90 

Red 2^0 60 

Green 180 30 



DEPOT 



Red Ships 



UNLOAD CARGO 
EXPONENTIAL DIST: 
MEAN=60 



Green 
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NORMAL DIST: 
Mean=60, StDev=15 



DOCK 
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LOAD CARGO 
Constant=120 



CONDITIONAL y 



CONDITIONAL 
UNTIL BARGE' 


,: "WAIT" 

NOT IN USE 


5 


f 


LEAVE THE PORT 
ALL SHIPS 



Figure 7. Problem Flow Diagram (Action Sequence) 
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Figure 8. Terminal Session for Port Facility Problem 
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Figure 8. (CONTINUED) 



ENTER OPTIONAL ATTRIBUTE FOR DEPOT (OR "NOME”) 
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ENTER OPTIONAL ATTRIBUTE FOR HARBOR (OR "NONE") 
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Figure 8. (CONTINUED 



CONSTRUCT ACTION SEQUENCE FOR PORT 

PROBABILITY DISTRIBUTIONS AVAILABLE: EXPONENTIAL, UNIFORM, NORMAL, CONSTANT 
ENTER TYPE OF DISTRIBUTION FOR TIME BETWEEN ARRIVALS OF SHIP AT THE PORT 
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CONTINUED 



NAME NEXT ACTION AND LOCATION FOR SHIP 
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GIVE MEAN OF EXPONENTIAL DIST. FOR DURATION OF UNLOAD RED SHIP 
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OTHER THAN BLUE SHIP PASS THROUGH SPLIT1 



ENTER PERCENT OF EACH BRANCH FROM SPLIT1 
EXAMPLE: 30,30,40 
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Figure 8. (CONTINUED) 



DOES DURATION OF LOAD DEPEND ON COLOR 
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Figure 8. (CONTINUED) 
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FIGURE 9, ENCODED ENGLISH TEXT OF PORT FACILITY PROBLEM 
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FIGURE 10. GPSS REPRESENTATION OF PORT FACILITY PROBLEM 
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FIGURE 10* (CONTINUED) 



VI . CONCLUSIONS AND RECOMMENDATIONS 



The question-answer decoding system provides a con- 
venient method for expressing simple queuing problems and 
producing the corresponding data structures that enable 
GES or XGES to encode a GPSS simulation program. The user 
of this system should know the general concepts of com- 
puter simulation, including how to specify and apply pro- 
bability distributions, and how to run a GPSS program and a 
analyze the results it produces. A knowledge of GPSS pro- 
gramming is helpful but not essential. The question-answer 
decoding system and the encoding systems described in Refs. 

5 and 6 demonstrate the tremendous potential of NLP for 
language processing. 

A. GENERAL RESULTS 

During the development of the quest ion-answer decoding 
system, a number of techniques were devised which should 
be useful in future work with NLP. Among them are: 

(1) A scheme for collecting, processing and classifying 
words (usually names) that have not been previously defined. 

(2) A convenient method of communicating information 
from the data structure to the user while in the decoding 
mode of operation. 

(3) The "action-list/successor-attribute" method of 
developing the action sequence. 
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(4) The "temporary action" scheme to represent actions 
that are imminent but must be defined or identified on 
later cycles. 

(5) The overall logic for simulation problem definition 
that allows an encodable IDS to be generated. 

B . FUTURE DEVELOPMENT 

There is considerable potential for expansion and 
future development of the question-answer decoding system. 
Three areas in which work could be done immediately are: 

(1) Expansion of system capability to include such 
things as shortest line choices, queue length or other con- 
dition tests, user specification of GPSS output statistics, 
and increased flexibility in user input choices. 

(2) Development of encoding rules to produce English 
sentences at the terminal during the question-answer session. 

(3) Inclusion of the capability to back up and modify 
something previously completed, such as an action record, 
without having to re-construct the entire problem. 
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APPENDIX A 



NLP DECODING RULE SYMBOLOGY AND IMPLEMENTATION 



Appendix B of Ref. 5 describes NLP encoding rule symbo- 
logy and usage. The items listed therein which apply to NLP 
decoding rules have been duplicated as numbers 7 through 28 
below. Other items below pertain specifically to the NLP 
decoding process . 

(1) The decoding rules are automatically initiated with 
the processing of a period (".") followed by a blank (" ") 
immediately ahead of the terminal input string. 

(2) The first character is then read from the input string, 
and all rules that begin with or are now waiting (as a 
result of the period and blank) for that character (as a 
condition on the left) become or remain "active." 



(3) Any active rule for which all conditions on the left 
have been tested is said to be "satisfied," and the SEG- 
MENT’S indicated on the right are created. Each newly 
created SEGMENT is treated in a manner similar, with respect 
to rule processing, to a character from the input string, 
and can cause additional rules to become active or be satis- 
fied. This process continues until all possible SEGMENT'S 
have been created and no further rule testing is applicable. 
Next, all rules which were ’waiting for a character other 
than the last input character become inactive, and the 
next character of the input string is read. 



(^) When the input string is exhausted, or (in the case of 
the question-answer decoding system) a REPLY segment is 
created, the decoding process is terminated. 

(5) When decoding is terminated, all rules become inactive. 
Also, all SEGMENT records that have not become part of the 
IDS are erased and their component cells are returned to a 
free cell list for future use. 



(6) When a particular rule is satisfied, the SEGMENT or 
SEGMENT'S that appear on the left are not lost (as in the 
encoding process) and remain available for other rules 
until decoding is complete. Thus the concept of "parallel" 
processing is introduced. 
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(7) An individual rule is processed from left to right. 

The portion of the rule to the left of the transformation 
symbol (-->) is called the "condition" of the rule and is 
used to test for the applicability of the rule. The portion 
of the rule to the right of the transformation symbol is 
called "labeling" and is used to designate the method and 
extent of new record construction. 

(8) A record is explicitly referenced by designating the 
record name (i.e. SEGMENT TYPE), or, if the record to be 
referenced is the one currently being tested or constructed, 
by designating the record name RECORD, SEGMENT, or SEG. 

(9) If the record to be referenced is the one currently 
being tested or constructed, it may be implicitly designated 
by default. 

(10) An attribute may be referenced either by name or by 
number. The number designation range of legal values is 
from 11 to 300 and is specified by preceeding the number 
with the symbol "@". 

(11) MEMORY is a unique record in that no copies are made 

of it. All modifications of MEMORY are made on the original 

(12) Since the rules seldom process original records, the 
primary w ay to add. delete, or change attribute information 
in an original record is to do so via a MEMORY designation. 

(13) If a record to be constructed is to be of the same 
SEGMENT TYPE as the condition record, is to be a copy of the 
condition record, and only one such record is to be con- 
structed, then only the SEGMENT TYPE need be listed to 
accomplish the construction. 

(1*0 — > is called the transformation symbol and means 
"construct the designated records which follow." 

(15) (_ means "of",*e.g. SUCC (ACT) mean "successor of action 

(16) 2. and j_ are used to assist in reading the rules. 

(17) $Q means "test attribute number Q through successive 
records possessing attribute Q where Q is an integer and 
attribute Q points to a record that may possess attribute 
Q. Non-designation of Q will default to 1, signifying the 
SUP attribute. 

(18) .EQ. , .NE . , . GT . , .LT. , LE . , and . GE. have the same 
meaning as in FORTRAN and are used for the testing of rule 
conditions . 

(19) = has the same meaning as in FORTRAN. 



o3 



(20) means "not" and is used in the testing of. rule 
conditions . 

(21) - if not used in an arithmetic expression, means "erase 
the designated attribute", or turn off the designated 
indicator . 

(22) + , -, and / have the standard arithmetic operator 
meanings. Note that they must be used with attributes of 
TYPE 0 cell construction and that no more than one operation 
may be used in an arithmetic expression. 

(23) * XXXX ' when appearing to the left of the transforma- 
tion symbol, means "test to determine if the first SUP 
attribute of this record points to XXXX." 

(24) 1 XXXX ' when appearing to the right of the transforma- 
tion symbol, means "assign a pointer to XXXX as the value 
of the first SUP attribute of this record." 

(25) " ZZZ " denotes the character (EBCDIC) representation 
ZZZ. These double quotes are usually used with an attribute 
of TYPE 1 cell construction. 

(26) 1 means "or" and applies only to complete test condi- 
tions. Also, this symbol must be used for the rightmost 
test conditions as the rule condition is satisfied whenever 
any one of the test conditions separated by 1 is satisfied, 
e.g. BLOK(SUCC.EG. AGENT, GOAL. EQ. AGENT | ’TRANSFER’). 

(27) @Q means "attribute number Q" where Q is either an 
integer or an attribute of TYPE .0 cell construction. 

(28) %_ means "make a copy of the designated record to become 
the structure of this record." 

(29) # represents blank space in the input string. 
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ATTRIBUTES, ROUTINES, INDICATORS, AND NAMED RECORDS FOR RULES 
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WRITE ( WTERM » 10 1 ) < CCL ( I ) » 1=1 t 80 ) 
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RULE-CALLABLE FORTRAN ROUTINES 
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COHN) = BLANK 
J = HVA L ( A JNUM 7 SEG MNT ) 

CALL C 0 L E C T ( DWORD) 

50 41 CALL SE T SUP ( C WORD t DWORD T ZERO 7 S EGMNT ? J ) 
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